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Abstract: In genetic studies of complex diseases, the underlying mode of in- 
heritance is often not known. Thus, the most powerful test or other optimal 
procedure for one model, e.g. recessive, may be quite inefficient if another 
model, e.g. dominant, describes the inheritance process. Rather than choose 
among the procedures that are optimal for a particular model, it is preferable 
to see a method that has high efficiency across a family of scientifically realis- 
tic models. Statisticians well recognize that this situation is analogous to the 
selection of an estimator of location when the form of the underlying distribu- 
tion is not known. We review how the concepts and techniques in the efficiency 
robustness literature that are used to obtain efficiency robust estimators and 
rank tests can be adapted for the analysis of genetic data. In particular, several 
statistics have been used to test for a genetic association between a disease and 
a candidate allele or marker allele from data collected in case-control studies. 
Each of them is optimal for a specific inheritance model and we describe and 
compare several robust methods. The most suitable robust test depends some- 
what on the range of plausible genetic models. When little is known about 
the inheritance process, the maximum of the optimal statistics for the extreme 
models and an intermediate one is usually the preferred choice. Sometimes one 
can eliminate a mode of inheritance, e.g. from prior studies of family pedigrees 
one may know whether the disease skips generations or not. If it does, the 
disease is much more likely to follow a recessive model than a dominant one. 
In that case, a simpler linear combination of the optimal tests for the extreme 
models can be a robust choice. 



1. Introduction 

For hypothesis testing problems when the model generating the data is known, 
optimal test statistics can be derived. In practice, however, the precise form of the 
underlying model is often unknown. Based on prior scientific knowledge a family 
of possible models is often available. For each model in the family an optimal 
test statistic is obtained. Hence, we have a collection of optimal test statistics 
corresponding to each member of the family of scientifically plausible models and 
need to select one statistic from them or create a robust one, that combines them. 
Since using any single optimal test in the collection typically results in a substantial 
loss of efficiency or power when another model is the true one, a robust procedure 
with reasonable power over the entire family is preferable in practice. 
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The above situation occurs in many applications. For example, in survival anal- 
ysis Harrington and Fleming [14] introduced a family of statistics C. The family 
includes the log-rank test (p = 0) that is optimal under the proportional hazards 
model and the Peto-Peto test {p — 1, corresponding to the Wilcoxon test without 
censoring) that is optimal under a logistic shift model. In practice, when the model 
is unknown, one may apply both tests to survival data. It is difficult to draw scien- 
tific conclusions when one of the tests is significant and the other is not. Choosing 
the significant test after one has applied both tests to the data increases the Type 
I error. A second example is testing for an association between a disease and a risk 
factor in contingency tables. If the risk factor is a categorical variable and has a 
natural order, e.g., number of packs of cigarette smoking per day, the Cochran- 
Armitage trend test is typically used. (Cochran [3] and Armitage To apply 
such a trend test, increasing scores as values of a covariate have to be assigned to 
each category of the risk factor. Thus, the p-value of the trend test may depend 
on such scores. A collection of trend tests is formed by choosing various increasing 
scores. (Graubard and Korn [l^ ) A third example arises in genetic linkage and 
association studies. In linkage analysis to map quantitative trait loci using affected 
sib pairs, optimal tests are functions of the number of alleles shared identical-by- 
descent (IBD) by the two sibs. The IBD probabilities form a family of alternatives 
which are determined by g enetic models. See, e.g., Whittemore and Tu and 



Gastwirth and Freidlin [10|. In genetic association studies using case-parents trios. 



the optimal test depends on the mode of inheritance of the disease (recessive, dom- 
inant, or co-dominant disease). For complex diseases, the underlying genetic model 
is often not known. Using a single optimal test does not protect against a sub- 
stantial loss of power under the worst situation, i.e., when a much different genetic 
model is the true one. (Zheng, Freidlin and Gastwirth [25]) 

Robust procedures have been developed and applied when the underlying model 
is unknown as discussed in Gastwirth 0, HHIj Birnbaum and Laska [2], Podgor, 
Gastwirth and Mehta [13], and Freidlin, Podgor and Gastwirth 5]. In this article, 
we review two useful robust tests. The first one is a linear combination of the two or 
three extreme optimal tests in a family of optimal statistics and the second one is 
a suitably chosen maximum statistic, i.e., the maximum of several of the optimum 
tests for specific models in the family. These two robust procedures are applied to 
genetic association using case-control studies and compared to other test statistics 
that are used in practice. 



2. Robust procedures: A short review 



Suppose we have a collection of alternative models {Mi, i G /} and the correspond- 
ing optimal (most powerful) test statistics {Ti : i G /} are obtained, where / can be 
a finite set or an interval. Under the null hypothesis, assume that each of these test 
statistics is asymptotically normally distributed, i.e., Zi = [T^ — E(Ti)]/{Var(Ti)}^/^ 
converges in law to A^(0, 1) where E(Ti) and Var(rj) are the mean and the variance 
of Ti under the null; suppose also that for any i,j G I, Zi and Zj are jointly normal 
with the correlation pij. When Mi is the true model, the optimal test Zi would 
be used. When the true model Mi is unknown and the test Zj is used, assume the 
Pitman asymptotic relative efficiency (ARE) of Zj relative to Zi is e{Zj, Zi) = P% 
for I, j G /. These conditions are satisfied in many applications, (van Eeden [2l| 
and Gross [13|) 
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2.1. Maximin efficiency robust tests 



When the true model is unknown and each model in the family is scientifically 
plausible, the minimum ARE compared to the optimum test for each model, Zi, 
when Zj is used is given by infig/ e{Zj, Zi) for j S /. One robust test is to choose 
the optimal test Zi from the family {Zi : i E 1} which maximizes the minimum 
ARE, that is, 

(2.1) inf e(Z;, Zi) = sup inf e{Zj, Zi). 

Under the null hypothesis, Zi converges in distribution to a standard normal random 
variable and, under the definition (j2.ip . is the most robust test in {Zi : i e /}. In 
practice, however, other tests have been studied which may have greater efficiency 
robustness. 

Although a family of models are proposed based on scientific knowledge and the 
corresponding optimal tests can be obtained, all consistent tests with an asymptot- 
ically normal distribution can be used. Denote all these tests for the problem by C. 
The original family of test statistics can be expanded to C. The purpose is to find 
a test Z from C, rather than from the original family {Zi : i G /}, such that 

(2.2) Me{Z,Z,) = snp Me{Z,Zi). 

The test Z satisfying (|2.2p is called maximin efficiency robust test (MERT). (Gast- 
wirth ff\) Whe n the family C is restricted to the convex linear combinations of 
{Zi : i £ I}, the resulting robust test is denoted as Zmert- Since {Zi : i € 1} C C, 

sup inf e(Z, Zi) > sup inf e{Zj, Zi). 
z&c^^"^ jei 

Assuming that inf^ ^g/ pij > e > 0, Gastwirth Q proved that Zmert uniquely 
exists and can be written as a closed convex combination of optimal tests Zi in 
the family {Zi : i £ i}. Although a simple algorithm when C is the class of linear 
combination of {Zi : i E 1} was given in Gastwirth (see also Zucker and Lakatos 
prf). the computation of Zmekt is more complicated as it is related to quadratic 
programming algorithms. (Rosen [l^) For many applications, ^mert can be easily 
written as a linear convex combination of two or three optimal tests in {Z^ : i e /} 
including the extreme pair defined as follows: two optimal tests Zs,Zt £ {Zi : i G /} 
are called extreme pair if pst = corr Hq{Zs, Zt) — inf^ jg/ pij > 0. Define a new test 
statistic Zgt based on the extreme pair as 

V I 

(2.3) Z, 



[2{l + Pst)V/' 



which is the MERT for the extreme pair. A necessary and sufficient condition for 
Zst to be Zmekt for the whole family {Zi : i G /} is given, see Gastwirth 8], by 

(2.4) psi + Pit>l + Pst, for aU i G /. 

Under the nuU hypothesis, Zmekt is asymptotically A^(0, 1). The ARE of the MERT 
given by (|2.4p is (1 + psf)/2. To find the MERT, the null correlations pij need to 
be obtained and the pair is the extreme pair for which pij is smallest. 
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2.2. Maximum tests 

The robust test Zmert is a linear combination of the optimal test statistics and 
with modern computers it is useful to extend the family C of possible tests to 
include non- linear functions of the Zi. A natural non- linear robust statistic is the 
maximum over the extreme pair {Zs,Zt) or the triple (Zg, Z^, Zt) for the entire 
family (Freidlin et al. [5|), i.e., 

Zmax2 = max(Z5, Zt) or Zmaxs = max(Zs, Z„, Zt). 

There are several choices for Z„ in Zmax3, e.g., = Zst (MERT for the extreme 
pair or entire family). As when obtaining the MERT, the correlation matrix {pij} 
guides the choice of Z„ to be used in MAX3, e.g., it has equal correlation with the 
extreme tests. A more complicated maximum test statistic is to take the maximum 
over the entire family Zmax = niaxig/ Zi or Zmax = niax^gc' Zi. Zmax was con- 
sidered by Davies 0] for some non-standard hypothesis testing whose critical value 
has to be determined by approximation of its upper bound. In a recent study of 
several applications in genetic association and linkage analysis, Zheng and Chen 
[2^ showed that Zmax3 and Zmax have similar power performance in these ap- 
plications. Moreover, Zuax2 or Zmax3 are much easier to compute than Zmax- 
Hence, in the next section, we only consider the two maximum tests Zmax2 and 
Zmax3- The critical values for the maximum test statistics can be found by simula- 
tion under the null hypothesis as any two or three optimal statistics in {Zi : i G 1} 
follow multivariate normal distributions with correlation matrix {pij}. For exam- 
ple, given the data, pst can be calculated. Generating a bivariate normal random 
variable {Z^j, Ztj) with the correlation matrix {pst} for j = 1, . . . , B. For each j, 
Zmax2 is obtained. Then an empirical distribution for Zmax2 can be obtained 
using these B simulated maximum statistics, from which we can find the critical 
values. In some applications, if the null hypothesis does not depend on any nuisance 
parameters, the distribution of Zmax2 or Zuax3 can be simulated exactly without 
the correlation matrix, e.g., Zheng and Chen |24| . 

2.3. Comparison of MERT and MAX 

Usually, ^MERT is easier to compute and use than Zuax2 (or ^maxs)- Intuitively, 
however, Zmax3 should have greater efficiency robustness than Zmekt when the 
range of models is wide. The selection of the robust test depends on the minimum 
correlation pst of the entire family of optimal tests. Results from Freidlin et al. [HI 
showed that when pst > 0.75, MERT and MAX2 (MAX3) have similar power; thus, 
the simpler MERT can be used. For example, when pst = 0.75, the ARE of MERT 
relative to the optimal test for any model in the family is at least 0.875. When 
Pst < 0.50, MAX2 (MAX3) is noticeably more powerful than the simple MERT. 
Hence, MAX2 (MAX3) is recommended. For example, in genetic linkage analysis 
using affected sib pairs, the minimum correlation is greater than 0.8. and the MERT, 
MAX2, and MAX3 have similar power. (Whittemore and Tu [22| and Gastwirth 
and Freidlin [l3|) For analysis of case-parents data in genetic association studies 
where the mode of inheritance can range from pure recessive to pure dominant, 
the minimum correlation is less than 0.33, and then the MAX3 has desirable power 
robustness for this problem. (Zheng et al. ^2^) 
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3. Genetic association using case-control studies 
3. 1 . Background 

It is well known that association studies testing linkage disequilibrium are more 
powerful than linkage analysis to detect small genetic effects on traits. (Risch and 
Merikangas [17]) Moreover association studies using cases and controls are easier 
to conduct as parental genotypes are not required. 

Assume that cases are sampled from the study population and that controls 
are independently sampled from the general population without disease. Cases and 
controls are not matched. Each individual is genotyped with one of three genotypes 
MM, MN and NN for a marker with two alleles M and N. The data obtained 
in case-control studies can be displayed as in Table [1] (genotype-based) or as in 
Tabic El (allele-based). 

Define three penetrances as /o = Pr(case| A^A^), /i = Pr(case|A^M), and /2 = 
Pr(case|MM), which are the disease probabilities given different genotypes. The 
prevalence of disease is denoted as D — Pr(case). The probabilities for genotypes 
{NN, NM, MM) in cases and controls are denoted by {po,pi,p2) and (go, (Zi, 92), 
respectively. The probabilities for genotypes (A^A^, NM, MM) in the general pop- 
ulation are denoted as {go, gi, 92)- The following relations can be obtained. 

(3.1) p^ = — and = for i = 0,l, 2. 

Note that, in Table 1, (ro,ri,r2) and {sq, 31,82) follow multinomial distributions 
mul{R;pQ,pi,p2) and mul{S;qo,qi,q2), respectively. Under the null hypothesis of 
no association between the disease and the marker, pi = qi — gi for i = 0,1,2. 
Hence, from p.ip . the null hypothesis for Table [T] is equivalent to -ffo • /o = /i = 
/2 = D. Under the alternative, penetrances are different as one of two alleles is a 
risk allele, say, M. In genetic analysis, three genetic models (mode of inheritance) 
are often used. A model is recessive (rec) when /q = /i, additive (add) when fi = 
(/o + /2)/2, and dominant (dom) when fi — /2. For recessive and dominant models, 
the number of columns in Table [1] can be reduced. Indeed, the columns with A^A^ 
and NM {NM and MM) can be collapsed for recessive (dominant) model. Testing 
association using Table [2] is simpler but Sasieni [l^ showed that genotype based 
analysis is preferable unless cases and controls are in Hardy- Weinberg Equilibrium. 







Table 1 






Genotype distribution for case-control studies 




NN 


NM 


MM 


Total 


Case 


ro 


ri 


r2 
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Control 


so 


Sl 


S2 


s 


Total 


no 




"2 


n 



Table 2 

Allele distribution for case-control studies 

N M Total 

Case 2ro + ri ri -\- 2r2 2r 

Control 2so + si s\ 2s2 2s 

Total 2no + rti ni + 2n2 2n 
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3.2. Test statistics 



For the 2x3 table (Table [H), a chi-squared test with 2 degrees of freedom (df) can 
be used. (Gibson and Muse 'llll) This test is independent of the underlying genetic 
model. Note that, under the alternative when M is the risk allele, the penetrances 
have a natural order: fa < fi < /2 (at least one inequality hold). The Cochran- 
Armitage (CA) trend test (Cochran [3] and Armitage flj) taking into account the 
natural order should be more powerful than the chi-squared test as the trend test 
has 1 df. 

The CA trend test can be obtained as a score test under the logistic regression 
model with genotype as a covariate, which is coded using scores x — [xq, a;i, 2:1) for 
(NN,NM,MM), where xq < xi < X2- The trend test can be written as (Sasieni 

m 



n 



1/2 ■ 



Since the trend test is invariant to linear transformations of x, without loss of 
generality, we use the scores x = {0,x, 1) with < a; < 1 and denote as Z^- 
Under the null hypothesis, has an asymptotic normal distribution N{0, 1). When 
M is a risk allele, a one-sided test is used. Otherwise, a two-sided test should be used. 
Results from Sasieni [3] and Zheng, Freidlin, Li and Gastwirth [5^ showed that the 
optimal choices of x for recessive, additive and dominant models are x = 0, x = 1/2, 
and a; = 1, respectively. That is, Zq, Z1/2 or Zi is an asymptotically most powerful 
test when the genetic model is recessive, additive or dominant. The tests using 
other values of x are optimal for penetrances in the range < /o < /i < /2 < 1. 

For complex diseases, the genetic model is not known a priori. The optimal 
test Zx cannot be used directly as a substantial loss of power may occur when x 
is misspecified. Applying the robust procedures introduced in Section 2, we have 
three genetic models and the collection of all consistent tests C = {Z^ : x G [0, 1]}. 
To find a robust test, we need to evaluate the null correlations. Denote these as 
corr Ha{Zxn Z,j.^ — Pj.^ j.^. From appendix C of Freidlin, Zheng, Li and Gastwirth 

_ Po(pi + 2p2) 

" {po(l -Po)}i/2{(pi + 2p2)P0 + {Pi + 2po)P2}i/2' 
PaP2 



Po,i = 
Pi/2,1 = 



{Ml-P0)}l/2{p2(l-p2)}l/2' 

P2{pi + 2po) 



{P2{1 - P2)V/H{P1 + 2p2)po + [Pl + 2po)P2}l/2' 



Although the null correlations are functions of the unknown parameters pi,i — 
0,1,2, it can be shown analytically that poA < Pa.1/2 and po.i < Pi/2,i- Note 
that if the above analytical results were not available, the pi would be estimated 
by substituting the observed data pt = Ui/n for pi. Here the minimum correla- 
tion among the three optimal tests occurs when Zq and Zi is the extreme pair 
for the three genetic models. Freidlin et al. also proved analytically that the 
condition {Ml) holds. Hence, Zmert = [Zq + ^i)/{2(l + Pq,i)V^^ is the MERT 
for the whole family C, where yOo,i is obtained when the pi are replaced by rii/n. 
The two maximum tests can be written as Zmax2 = max(2'o,2'i) and Zmax3 = 
iiiax.{Zo, Zi/2, Zi). When the risk allele is unknown, Zmax2 — maxdZgl, \ Zi\) and 
Zmax3 — niax(|Zo|, |^i/2|, \Zi\). Although we considered three genetic models, the 
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family of genetic models for case-control studies can be extended by defining a ge- 
netic model as penetrances restricted to the family {(/o,/i,/2) : /o < /i < /2}- 
Three genetic models are contained in this family as the two boundaries and one 
middle ray of this family. The statistics Zmert and Zmaxs are also the correspond- 
ing robust statistics for this larger family (see, e.g., Freidlin et al. Q and Zheng et 
al. 0). 

In analysis of case-control data for genetic association, two other tests are also 
currently used. However, their robustness and efficiency properties have not been 
compared to MERT and MAX. The first one is the chi-squared test for the 2x3 
contingency table (Table [1]), denoted as xi- (Gibson and Muse [lH) Under the null 
hypothesis, it has a chi-squared distribution with 2 df. The second test, denoted as 
Zp, is based on the product of two different tests: (a) the allele association (AA) 
test and (b) the Hardy- Weinberg disequilibrium (HWD) test. (Hoh, Wile and Ott 
[lit and Song and Elston [20] ) The AA test is a chi-squares test for the 2x2 table 
given in Table [2l which is written as 

2 _ 2n[(2ro + n){si + 2s2) - (2so + s,){n + 2r2)]2 
Xaa — 



4rs(2no + + 2^2) 

The HWD test detects the deviation from Hardy- Weinberg equilibrium (HWE) in 
cases. Assume the allele frequency of M is p = Pr(M). Using cases, the estimation 
oi p is p = (ri -I- 2r2)/(2r). Let q = 1 — p be the estimation of allele frequency for 
N. Under the null hypothesis of HWE, the expected number of genotypes can be 
written as E{NN) = rg^, E(7VM) = r2pq and E(MAf) = rp^, respectively. Hence, 
a chi-squared test for HWE is 

2 _ (ro - ^{NN)f ^ (ri - E{NM)f ^ (r2 - E(MAf))2 
Xhwd — 



E(iVA^) E(iVAf) E(MAf) 

The product test, proposed by Hoh et al. [1^1 , is Tp = Xaa ^ Xhwd- They noticed 
that the power performances of these two statistics are complementary. Thus, the 
product should retain reasonable power as one of the tests has high power when the 
other does not. Consequently, for a comprehensive comparison, we also consider the 
maximum of them, Tmax = niax(x^^, Xhwd)- Given the data, the critical values 
of Tp and Tmax can be obtained by a permutation procedure as their asymptotic 
distributions are not available. (Hoh et al. [3]) Note that Tp was originally proposed 
by Hoh et al. [3] as a test statistic for multiple gene selection and was modified by 
Song and Elston [2^ for use as a test statistic for a single gene. 



3.3. Power comparison 

We conducted a simulation study to compare the power performance of the test 
statistics. The test statistics were (a) the optimal trend tests for the three genetic 
models, Zq, Zxji and Zi, (b) MERT Zmert, (c) maximum tests .^max2 and Zmaxs, 
(d) the product test Tp, (e) Tmax, and (f) 

In the simulation a two-sided test was used. We assumed that the allele frequency 
p and the baseline penetrance /o are known (/p = .01). Note that, in practice, the al- 
lele frequency and penetrances are unknown. However, they can be estimated empir- 



ically (e.g.. Song and Elston (20[ and Wittke-Thompson, Pluzhnikov and Cox [23|). 
In our simulation the critical values for all test statistics are simulated under the null 
hypothesis. Thus, we avoid using asymptotic distributions for the test statistics. The 
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Type I errors for all tests are expected to be close to the nominal level a — 0.05 
and the powers of all tests are comparable. When HWE holds, the probabilities 
{po,Pi,P2) for cases and (go, 11,12) for controls can be calculated using (|3.1|) under 
the null and alternative hypotheses, where {go, 51, 52) = (<7^j 2pg, q^) and (/i, /2) are 
specified by the null or alternative hypotheses and D = figi . After calculating 
{po,Pi,P2) and {qo, qi, (72) under the null hypothesis, we first simulated the genotype 
distributions {ro,ri,r2) ~ mul{R;po,pi,p2) and {sq, 81,82) ^ ■mul{S;qo, 11,12) for 
cases and controls, respectively (see Table [ij. When HWE does not hold, we as- 
sumed a mixture of two populations with two different allele frequencies pi and p2. 
Hence, we simulated two independent samples with different allele frequencies for 
cases (and controls) and combined these two samples for cases (and for controls). 
Thus, cases (controls) contain samples from a mixture of two populations with dif- 
ferent allele frequencies. When p is small, some counts can be zero. Therefore, we 
added 1/2 to the count of each genotype in cases and controls in all simulations. 

To obtain the critical values, a simulation under the null hypothesis was done 
with 200,000 replicates. For each replicate, wc calculated the test statistics. For each 
test statistic, we used its empirical distribution function based on 200,000 replicates 
to calculate the critical value for a = 0.05. The alternatives were chosen so that 
the power of the optimal test Zq, Zi/2, Z\ was near 80% for the recessive, additive, 
dominant models, respectively. To determine the empirical power, 10,000 replicates 
were simulated using multinomial distributions with the above probabilities. 

To calculate .^mert, the correlation po,! was estimated using the simulated data. 
In Table 3, we present the mean of correlation matrix using 10,000 replicates when 
r = 8 — 250. The three correlations /3o,i/2; Po,i: Pi/2,1 were estimated by replacing 
Pi with ni/n,i = 0, 1,2 using the data simulated under the null and alternatives 
and various models. The null and alternative hypotheses used in Table [3] were also 
used to simulate critical values and powers (Table S]). Note that the minimum 
correlation po,i is less than .50. Hence, the ZyiAxs should have greater efficiency 
robustness than 2^mert- However, when the dominant model can be eliminated 
based on prior scientific knowledge (e.g. the disease often skips generations), the 
correlation between Zq and Z1/2 optimal for the recessive and additive models would 
be greater than .75. Thus, for these two models, Zmert should have comparable 
power to Zmax2 — nrax(|Zo|, |.^i/2|) and is easier to use. 

The correlation matrices used in Table |4] for r ^ s and Table [5] for mixed sam- 
ples are not presented as they did not differ very much from those given in Table [31 
Tables |4] and [5] present simulation results where all three genetic models are plau- 
sible. When HWE holds Table [4] shows that the Type I error is indeed close to the 
a = 0.05 level. Since the model is not known, the minimum power across three 
genetic models is written in bold type. A test with the maximum of the minimum 
power among all test statistic has the most power robustness. Our comparison fo- 



Table 3 

The mean correlation matrices of three optimal test statistics based on 10,000 replicates 

when HWE holds 



Model 










■p 












.1 






.3 






.5 




Po,l/2 


P0,1 


Pi/2,1 


P0,l/2 


P0,1 


Pi/2,1 


P0,l/2 


Po,i 


Pi/2,1 


null 


.97 


.22 


.45 


.91 


.31 


.68 


.82 


.33 


.82 


rcc 


.95 


.34 


.63 


.89 


.36 


.74 


.81 


.37 


.84 


add 


.96 


.23 


.48 


.89 


.32 


.71 


.79 


.33 


.84 


dom 


.97 


.21 


.44 


.90 


.29 


.69 


.79 


.30 


.83 



The same models (rec,add,dom) are used in Table 4 when r = s = 250. 
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Table 4 

Power comparison when HWE holds in cases and controls under three genetic models 

with a = .05 



Test statistics 



p 


Model 


Zo 


Zl/2 


Zi 


^MERT 


ZmAX2 


Zmaxs 


Tmax 


Tp 


xi 












r = 250, s 


= 250 










.1 


null 


.058 


.048 


.050 


.048 


.047 


.049 


.049 


.049 


.053 




rec 


.813 


.364 


.138 


.606 


.725 


.732 


.941 


.862 


.692 




add 


.223 


.813 


.802 


.733 


.782 


.800 


.705 


.424 


.752 




dom 


.108 


.796 


.813 


.635 


.786 


.795 


.676 


.556 


.763 


.3 


null 


.051 


.052 


.051 


.054 


.052 


.052 


.053 


.049 


.050 




rec 


.793 


.537 


.178 


.623 


.714 


.726 


.833 


.846 


.691 




add 


.433 


.812 


.768 


.786 


.742 


.773 


.735 


.447 


.733 




dom 


.133 


.717 


.809 


.621 


.737 


.746 


.722 


.750 


.719 


.5 


null 


.049 


.047 


.051 


.047 


.047 


.047 


.050 


.051 


.050 




rec 


.810 


.662 


.177 


.644 


.738 


.738 


.772 


.813 


.709 




add 


.575 


.802 


.684 


.807 


.729 


.760 


.719 


.450 


.714 




dom 


.131 


.574 


.787 


.597 


.714 


.713 


.747 


.802 


.698 












r = 50, s 


= 250 










.1 


null 


.035 


.052 


.049 


.052 


.051 


.053 


.045 


.051 


.052 




rec 


.826 


.553 


.230 


.757 


.797 


.802 


.779 


.823 


.803 




add 


.250 


.859 


.842 


.773 


.784 


.795 


.718 


.423 


.789 




dom 


.114 


.807 


.814 


.658 


.730 


.734 


.636 


.447 


.727 


.3 


null 


.048 


.048 


.048 


.048 


.050 


.050 


.050 


.048 


.049 




rec 


.836 


.616 


.190 


.715 


.787 


.787 


.733 


.752 


.771 




add 


.507 


.844 


.794 


.821 


.786 


.813 


.789 


.500 


.778 




dom 


.171 


.728 


.812 


.633 


.746 


.749 


.696 


.684 


.729 


.5 


null 


.049 


.046 


.046 


.046 


.047 


.047 


.046 


.049 


.046 




rec 


.838 


.692 


.150 


.682 


.771 


.765 


.705 


.676 


.748 




add 


.615 


.818 


.697 


.820 


.744 


.780 


.743 


.493 


.728 




dom 


.151 


.556 


.799 


.565 


.710 


.708 


.662 


.746 


.684 



cuses on the test statistics: Zmaxs, Tmax, 7p and X2- O^'' results show that Tmax 
has greater efRciency robustness than Tp while .^max3, Tmax and xi have similar 
minimum powers. Notice that .^maxs is preferable to X2 although the difference 
in minimum powers depends on the allele frequency. When HWE does not hold, 
■^MAX3 still possesses its efficiency robustness, but Tmax and Tp do not perform 
as well. Thus, population stratification affects their performance. 2^max3 also re- 
mains more robust than X2 even when HWE does not hold. From both Table |4] and 
Table [SJ X2 raore powerful than Zueki except for the additive model. However, 
when the genetic model is known, the corresponding optimal CA trend test is more 
powerful than X2 with 2 df. 

From Tables 2] and [51 one sees that the robust test Zmax3 tends to be more 
powerful than X2 under the various scenarios we simulated. Further comparisons of 
these two test statistics using p- values and the same data given in Table S] (when 
r — s — 250) are reported in Table [51 Following Zheng et al. [2^, which reported 
a matched study, where both tests are applied to the same data, the p-values for 
each test are grouped as < .01, (.01, .05), (.05, .10), and > .10. Cross classification 
of the p- values are given in Table 6 for allele frequencies p — .1 and .3 under all 
three genetic models. Table [6] is consistent with the results of Tables [4] and [5l i.e., 
■Z^MAxa is more powerful than the chi-squared test with 2 degrees of freedom when 
the genetic model is unknown. This is seen by comparing the counts at the upper 
right corner with the counts at lower left corner. When the counts at the upper 
right corner are greater than the corresponding counts at the lower left corner, 
■^MAX3 usually has smaller p- values than xi- In particular, we compare two tests 
with p-values < .01 versus p-values in (.01, .05). Notice that in most situations 
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Table 5 

Power comparison when HWE does not hold in cases and controls under three genetic models 
(Mixed samples with different allele frequencies (pi,P2) ci'i^d sample sizes {Ri,Si) and {R2,S2) 









with r - 


= Ri + 


R2, s = Si + 


S2, and a - 


= .05;. 


















Test statistics 










(P1.P2) 


Model 


Zo 


^1/2 


Zi 


^MERT ' 


^MAX2 Z 


MAX3 


Tmax 




X2 








Ri = 


250, Si : 


= 250 and R2 


= 100, S2 - 


= 100 








(.1,.4) 


null 


.047 


.049 


.046 


.050 


.047 


.046 


.048 


.046 


.048 




rec 


.805 


.519 


.135 


.641 


.747 


.744 


.936 


.868 


.723 




add 


.361 


.817 


.771 


.776 


.737 


.757 


.122 


.490 


.688 




dom 


.098 


.715 


.805 


.586 


.723 


.724 


.049 


.097 


.681 


(.1,.5) 


null 


.047 


.050 


.046 


.048 


.046 


.046 


.052 


.050 


.052 




rcc 


.797 


.537 


.121 


.620 


.746 


.750 


.920 


.839 


.724 




add 


.383 


.794 


.732 


.764 


.681 


.709 


.033 


.620 


.631 




dom 


.095 


.695 


.812 


.581 


.697 


.703 


.001 


.133 


.670 


(.2,.5) 


null 


.048 


.052 


.052 


.054 


.052 


.052 


.050 


.047 


.050 




rcc 


.816 


.576 


.157 


.647 


.760 


.749 


.889 


.881 


.725 




add 


.417 


.802 


.754 


.782 


.726 


.743 


.265 


.365 


.684 




dom 


.112 


.679 


.812 


.600 


.729 


.715 


.137 


.122 


.696 








Ri = 


: 30, Si : 


= 150 and R2 


= 20, S2 = 


100 








(.1,.4) 


null 


.046 


.048 


.048 


.046 


.048 


.046 


.055 


.045 


.048 




rec 


.847 


.603 


.163 


.720 


.807 


.799 


.768 


.843 


.779 




add 


.387 


.798 


.762 


.749 


.725 


.746 


.449 


.309 


.691 




dom 


.139 


.733 


.810 


.603 


.721 


.728 


.345 


.223 


.699 


(.1,.5) 


null 


.053 


.055 


.050 


.053 


.053 


.053 


.059 


.047 


.054 




rec 


.816 


.603 


.139 


.688 


.776 


.780 


.697 


.827 


.741 




add 


.415 


.839 


.797 


.798 


.763 


.781 


.276 


.358 


.703 




dom 


.120 


.726 


.845 


.612 


.750 


.752 


.149 


.133 


.716 


(.2,.5) 


null 


.047 


.048 


.050 


.051 


.048 


.048 


.049 


.050 


.044 




rec 


.858 


.647 


.167 


.722 


.808 


.804 


.768 


.852 


.783 




add 


.472 


.839 


.790 


.811 


.776 


.799 


.625 


.325 


.740 




dom 


.139 


.708 


.815 


.614 


.741 


.743 


.442 


.390 


.708 



the number of times xi ^ p-value < .01 and 2^max3 has a p-value in (.01, .05) 
is much less than the corresponding number of times when .^maxs has a p-value 
< .01 and X2 has a p-value in (.01, .05). For example, when p — .2> and the additive 
model holds, there are 289 simulated datasets where has a p-value in (.01, .05) 
while .^MAX3 has a p-value < .01 versus only 14 such datasets when Zmaxs has 
a p-value in (.01, .05) while has a p-value < .01. The only exception occurs at 
the recessive model under which they have similar counts (165 vs. 140). Combining 
results from Tables 4 and 6, Zmax3 is more powerful than x|, but the difference of 
power between 2'max3 and is usually less than 5% in the simulation. Hence xl 
is also an efficiency robust test, which is very useful for genome- wide association 
studies, where hundreds of thousands of tests are performed. 

From prior studies of family pedigrees one may know whether the disease skips 
generations or not. If it does, the disease is less likely to follow a pure-dominant 
model. Thus, when genetic evidence strongly suggests that the underlying genetic 
model is between the recessive and additive inclusive, we compared the performance 
of tests Zm-ekt = [Zo + ^i/2)/{2(l -I- Po,i/2)V^'^, ^max2 = max(Zo, Z1/2), and xi- 
The results are presented in Table [7l The alternatives used in Table [4] for rec and 
add with r = s = 250 were also used to obtain Table \7\ For a family with the 
recessive and additive models, the minimum correlation is increased compared to 
the family with three genetic models (rec, add and dom). For example, from Table[31 
the minimum correlation with the family of three models that ranges from .21 to 
.37 is increased to the range of .79 to .97 with only two models. From Table [7l 
under the recessive and additive models, while Zmax2 remains more powerful than 



Robust tests for genetic association 



263 



Table 6 

Matched p-value comparison of Zmax3 '^'^'^ X2 when HWE holds 
in cases and controls under three genetic models 
(Sample sizes r = s = 250 and 5,000 replications) 



p 


Model 


^MAX3 


< .01 


.01 - .05 


.05 - .10 


> .10 


.10 


rcc 


< .01 


2069 


165 












.01 - .05 


140 


1008 


251 









.05 - .10 


7 


52 


227 


203 






> .10 


1 


25 


47 


805 




add 


< .01 


2658 


295 












.01 - .05 


44 


776 


212 









.05 - .10 


3 


23 


159 


198 






> .10 





5 


16 


611 




dom 


< .01 


2785 


214 












.01 - .05 


80 


712 


214 









.05 - .10 


10 


42 


130 


169 






> .10 


1 


14 


27 


602 


.30 


rec 


< .01 


2159 


220 












.01 - .05 


85 


880 


211 









.05 - .10 


6 


44 


260 


157 






> .10 


2 


33 


40 


903 




add 


< .01 


2485 


289 












.01 - .05 


14 


849 


229 









.05 - .10 


1 


8 


212 


215 






> .10 





1 


6 


691 




dom 


< .01 


2291 


226 












.01 - .05 


90 


894 


204 









.05 - .10 


7 


52 


235 


160 






> .10 





26 


49 


766 



Table 7 

Power comparison when HWE holds in cases and controls assuming two genetic models 
(rec and add) based on 10,000 replicates (r = s = 250 and fo = .01^ 



Model 










P 












.1 






.3 






.5 




.^MERT 




xi 


■^MERT 




xi 


^MERT 




xi 


null 


.046 


.042 


.048 


.052 


.052 


.052 


.053 


.053 


.053 


rcc 


.738 


.729 


.703 


.743 


.732 


.687 


.768 


.766 


.702 


add 


.681 


.778 


.778 


.714 


.755 


.726 


.752 


.764 


.728 


other^ 


.617 


.830 


.836 


.637 


.844 


.901 


.378 


.547 


.734 


other-^ 


.513 


.675 


.677 


.632 


.774 


.803 


.489 


.581 


.652 


other^ 


.385 


.465 


.455 


.616 


.672 


.655 


.617 


.641 


.606 



other^ is dominant and the other two are semi-dominant, all with /2 = .019. 
other fi = .019. other^; /i = .017. other^: /i = .015. 



xi and 2^mert, the difference in minimum power is much less than in the previous 
simulation study (Table |4|). Indeed, when studying complex common diseases where 
the allele frequency is thought to be fairly high, Zmax2 and Zmert have similar 
power. Thus, when a genetic model is between the recessive and additive models 
inclusive, MAX2 and MERT should be used. In Table[7l some other models were also 
included in simulations when we do not have sound genetic knowledge to eliminate 
the dominant model. In this case, MAX2 and MERT lose some efficiency compared 
to xi- However, MAX3 still has greater efficiency robustness than other tests. In 
particular, MAX3 is more powerful than xi (^lot reported) as in Table [D Thus, 
MAX3 should be used when prior genetic studies do not justify excluding one of 
the basic three models. 
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4. Discussion 

In this article, we review robust procedures for testing hypothesis when the underly- 
ing model is unknown. The implementation of these robust procedures is illustrated 
by applying them to testing genetic association in case-control studies. Simulation 
studies demonstrated the usefulness of these robust procedures when the underlying 
genetic model is unknown. 

When the genetic model is known (e.g., recessive, dominant or additive model), 
the optimal Cochran- Armitage trend test with the appropriate choice of x is more 
powerful than the chi-squared test with 2 df for testing an association. The genetic 
model is usually not known for complex diseases. In this situation, the maximum of 
three optimal tests (including the two extreme tests), Zmaxs, is shown to be efficient 
robust compared to other available tests. In particular, Zmax3 is slightly more 
powerful than the chi-squared test with 2 df. Based on prior scientific knowledge, 
if the dominant model can be eliminated, then MERT, the maximum test, and the 
chi-squared test have roughly comparable power for a genetic model that ranges 
from recessive model to additive model and the allele frequency is not small. In this 
situation, the MERT and the chi-squared test are easier to apply than the maximum 
test and can be used by researchers. Otherwise, with current computational tools, 
Zmax3 is recommended. 
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